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Abstract 

The Eastern Himalayan region of Northeast (NE) India is home to a large number of indigenous rice varieties, which 
may serve as a valuable genetic resource for future crop improvement to meet the ever-increasing demand for 
food production. However, these varieties are rapidly being lost due to changes in land-use and agricultural 
practices, which favor agronomically improved varieties. A detailed understanding of the genetic structure and 
diversity of indigenous rice varieties is crucial for efficient utilization of rice genetic resources and for developing 
suitable conservation strategies. To explore the genetic structure and diversity of rice varieties in NE India, we 
genotyped 300 individuals of 24 indigenous rice varieties representing sali, bom, jum and glutinous types, 5 
agronomically improved varieties, and one wild rice species (0. rufipogon) using seven SSR markers. A total of 85 
alleles and a very high level of gene diversity (0.776) were detected among the indigenous rice varieties of the 
region. Considerable level of genetic variation was found within indigenous varieties whereas improved varieties 
were monoporphic across all loci. The comparison of genetic diversity among different types of rice revealed that 
sali type possessed the highest gene diversity (0.747) followed by jum (0.627), glutinous (0.602) and boro (0.596) 
types of indigenous rice varieties, while the lowest diversity was detected in agronomically improved varieties 
(0.459). The AMOVA results showed that 66% of the variation was distributed among varieties indicating a very high 
level of genetic differentiation in rice varieties in the region. Two major genetically defined clusters corresponding 
to indica and japonica groups were detected in rice varieties of the region. Overall, traditionally cultivated 
indigenous rice varieties in NE India showed high levels of genetic diversity comparable to levels of genetic 
diversity reported from wild rice populations in various parts of the world. The efforts for conservation of rice 
germplasm in NE India should consider saving rice varieties representing different types with specific emphasis 
given to sali and jum types. The protection against the loss of vast genetic diversity found in indigenous rice 
varieties in NE India is crucial for maintaining future food security in the changing world. 
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Introduction 

The Asian cultivated rice (Oryza sativa L.) is one of the 
most important crops and a major food source for more 
than half of the global human population. Phylogeogra- 
phical and archeological evidence suggest that rice was 
domesticated over 10000 years ago from its wild ances- 
tor O. rufipogon in the region south of the Himalayan 
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mountain range, likely in the present day Eastern and 
NE India, extending Eastward to Nepal, Myanmar and 
Thailand to Southern China (Chang 1976; Khush 1997; 
Londo et al. 2006). A recent study suggests that one of 
the two sub-species of Asian rice, O. sativa ssp indica 
was domesticated in Southeast and South Asia while the 
other sub-species, O. sativa ssp japonica was domesti- 
cated in Southern China (Huang et al. 2012). During the 
domestication process, individuals with desirable traits 
have been selected leaving most of the genetic diversity 
behind in the progenitors (Doebley et al. 2006). Zhu 
et al. (2007) estimated that the cultivated rice contains 
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only about 25% of the genetic diversity found in its wild 
progenitors depicting severe genetic erosion during do- 
mestication. Furthermore, a considerable level of genetic 
diversity was lost during the agronomic improvement of 
commonly cultivated rice. 

Studies have shown that indigenous crop varieties 
traditionally cultivated and maintained by farmers con- 
tain high level of genetic diversity and can serve as po- 
tential genetic resources for improving yield, resistance 
to pests and pathogens, and agronomic performance 
(Brush 1995; Hoisington et al. 1999; Mandel et al. 2011). 
The Eastern Himalayan region of NE India, a geogra- 
phical area of over 255,000 km consisting of Arunachal 
Pradesh, Assam, Manipur, Meghalaya, Mizoram, Nagaland 
and Tripura states (Figure 1), is home to a large number 
of indigenous rice varieties. These varieties are cultivated 
in diverse topographic and agroclimatic conditions, and 
normally classified into different types based on the season 
of cultivation, habitat conditions and the grain quality. 

The sali type, which comprises majority of rice var- 
ieties of the region is cultivated in low-lying flood plains 
of NE India, mainly in the Brahmaputra and Barak 
Valley regions. The bow type is traditionally cultivated 
during the winter months (November through May) in 
low-lying areas where sufficient water is available during 
the cold and dry months of the year. Thus, bow type 
rice varieties may contain genotypes suitable for cold 



adaptation. The dryland cultivated rice varieties, nor- 
mally grown in slash and burn agriculture system, and 
locally known as jum type, show adaptations to a wide 
range of ecological conditions including low levels of soil 
moisture in areas at high altitudes reaching over 3000 m 
above sea level. The glutinous grain type rice is com- 
monly cultivated throughout the region as a source of 
grain for breakfast and dessert for many ethnic commu- 
nities in the region. In addition to cultivated indigenous 
rice varieties, natural populations of many wild rice spe- 
cies including O. rufipogon, O. granulata, O. officinalis, 
O. nivara, O. meyeriana, Hygwrhiza aristata, Leersia 
hexandra and Zizenia latifolia are also found in the 
Northeastern region of India (Hore 2005). 

The indigenous rice varieties cultivated by traditional 
farmers may contain a considerable genetic diversity that 
can serve as a source of germplasm for genetic improve- 
ments of cultivated varieties of rice. In general, diverse 
landraces traditionally cultivated by farmers around the 
centers of diversity and domestication of crops are con- 
sidered as key natural resources (Pusadee et al. 2009) 
important for maintaining the future food security in 
light of the changing climate. Although a few studies 
have examined the population genetic structure of O. 
sativa germplasm at a global scale (Glaszmann 1987; 
Garris et al. 2005), region specific studies are limited. 
Earlier studies based on morphology and agronomic 
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Figure 1 Map of Northeast India showing sampling sites of traditionally cultivated indigenous rice varieties. 
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traits (Vairavan et al. 1973; Borkakati et al. 2000; Sarma 
and Pattanayak 2009) as well as molecular markers (iso- 
zyme, RAPD, ISSR) demonstrated a high level of genetic 
diversity among indigenous rice varieties in NE India 
(Glaszmann et al. 1989; Sarma and Bahar 2005; Bhuyan 
et al. 2007). However, these studies were limited either 
to a particular group of varieties {e.g. glutinous rice and 
lowland varieties) or to a narrow geographic region. In 
particular, no extensive studies have focused on the gen- 
etic structure of some of the widely cultivated indigen- 
ous types such as boro (cultivated in low-lying perennial 
water bodies during winter season), jum (cultivated in 
upland areas in hill-slopes and low soil moisture condi- 
tion), sail (most widely cultivated rice during monsoon 
season) and glutinous (sticky rice with cultural impor- 
tance) covering the wider geographic area. 

The ongoing rapid changes in agricultural practices 
that favor agronomically improved varieties has become 
a serious threat for the persistence of indigenous rice 
varieties in NE India. Thus, conservation and manage- 
ment strategies are urgently needed to prevent further 
loss of genetic diversity inherent to indigenous rice var- 
ieties in the region. A detailed understanding of the ge- 
netic structure and diversity is needed for the planning 
and implementation of effective conservation, manage- 
ment and utilization of rice germplasm in the whole 
region. 

The objectives of the present study are to (1) assess 
genetic diversity among indigenous rice varieties in the 
Eastern Himalayan region of the NE India, (2) compare 
the genetic diversity in indigenous varieties with agro- 
nomically improved varieties (3) assess distribution of 
genetic diversity among different types and (4) infer 
the population genetic structure of rice varieties in 
NE India. 

Materials and methods 

Plant samples 

A total of 29 varieties of cultivated rice {Oryza sativa) 
were collected from various regions of NE India (Figure 1). 
These samples included 24 indigenous varieties represen- 
ting sail (12), jum (4), boro (3), and glutinous (5) types 
and 5 agronomically improved varieties. The variety 
name, type and locality are given in Table 1. Wild rice 
(O. rufipogon) accessions originally collected from Eastern 
India were obtained from the International Rice Research 
Institute (IRRI), Philippines. Either grains or fresh leaf 
samples were collected from the field and morphological 
characters were noted based on direct observation or in- 
terviewing the farmers. The agronomically improved var- 
ieties, released by the regional and central rice research 
institutes and widely cultivated for their higher yield were 
obtained from farmers of the region. Seeds were germi- 
nated in Petri dishes and transferred to small pots and 



grown in a greenhouse. Leaf samples from seedlings were 
harvested, air dried, and used for the study. Genomic 
DNA was extracted following a modified cetyltrimethyl 
ammonium bromide extraction protocol (Doyle and Doyle 
1987; Dayanandan et al. 1997). 

PCR assay and genotyping 

Seven SSR loci (RM302, RM341, RM130, RM307, 
RM169, RM204, RM264) with relatively high poly- 
morphism and distributed across the rice genome were 
selected for the genetic diversity analyses (Table 2) 
(Chen et al. 1997; Temnykh et al. 2000). The forward 
primers were labeled with IRD700 or IRD800 dye for 
genotyping in LI-COR 4000 IR2 DNA analyzer (Li-Cor 
Biosciences, Lincoln, NE). The PCR amplifications were 
performed in 25 uL reaction mixture consisting of 
0.2 mM dNTP, 2.5 mM MgCl 2 , 2.5 uL of lOx buffer, 2.5 
pmol of each primer and 0.2 U Taq polymerase. The 
thermocycling profile used was initial denaturation at 
94° (3 min) followed by 35 cycles of 94° (2 min), 50° 
(1 min), 72° (2 min) and a final extension of 72° for 
5 min. The amplified products were diluted (1:50) with 
loading dye (Formamide and Bromophenol blue), dena- 
tured at 94°C for 5 min and cooled on ice before loading 
to 6.0% denaturing polyacrylamide gels on a Li-COR auto- 
mated DNA sequencer with a size standard (50-350 bp, 
IRDye700 or IRD-800) (Li-Cor Biosciences). 

The size of each amplified fragment was determined 
by comparison with the size standard and scored to pre- 
pare the genotype matrix. To determine the optimum 
number of individuals per variety to be genotyped to 
capture the total diversity, the number of individuals an- 
alyzed were increased one by one until the number of al- 
leles reached to a maximum with no further increase for 
a given locus. Accordingly, we determined that 10 in- 
dividuals per variety was sufficient to capture the total 
genetic variation in a given variety. Therefore, we geno- 
typed 300 individuals (10 individuals per variety for 30 
varieties) at seven SSR loci for the present study. 

Data analysis 

The SSR genotype data matrix was used for assessing 
genetic diversity and structure in a hierarchical manner 
from overall (all indigenous varieties), through different 
types and each variety. The among type genetic diversity 
was calculated by considering all genotyped individuals 
of a given type as one population while genetic para- 
meters for among variety was calculated based on 10 ge- 
notyped individuals per variety. The observed average 
number of alleles per locus (A/a), average allelic richness 
(Rs), population differentiation (Fst) and Nei gene diver- 
sity (He) (Nei 1973) were calculated using FSTAT 2.9.2.3 
(Goudet 2001). Allelic richness is the number of alleles 
for each population averaged over loci and standardized 
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Table 1 Cultivation type, location and genetic diversity values of traditionally cultivated indigenous and agronomically 
improved rice varieties including the wild rice (O. rufipogon) in Northeast India (AP, Arunachal Pradesh; AS, Assam, 
ML, Meghalaya, MZ, Mizoram) 
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A = Observed no. of allele; Na = Average no. of alleles per 7 loci; Npo = No. of polymorphic loci; Npe = Percent polymorphic loci; R A = Rare allele; I = Shannon 
information index; He = Nei gene diversity. 



Table 2 Details of SSR loci used in the present study and their genetic diversity parameters 
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Chr, Chromosome location; Na, Observed number of alleles; He, Nei (1973) genetic diversity. 
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for the smallest population size. Average effective num- 
ber of alleles (A/e) and Shannon information index (L) 
were calculated using PopGene version 1.31 (Yeh et al. 
1999). Average pairwise genetic differences between var- 
ieties was calculated using Arlequin 3.5 (Excoffier and 
Lischer 2010). Analysis of Molecular Variance (AMOVA) 
(Excoffier et al. 1992) within variety, among variety and 
among types was performed in Arlequin 3.5 (Excoffier 
and Lischer 2010) to determine the distribution of var- 
iation at different hierarchical levels. The statistical sig- 
nificance of the variance components was tested with 
1000 permutations. 

Genetic distance among varieties were estimated using 
chord genetic distance method (Cavalli-Sforza and 
Edwards 1967). The genetic distance based clustering 
was performed with the unweighted pairgroup method 
with arithmetic mean (UPGMA) using PowerMarker 
v3.25 (Liu and Muse 2005), and the dendrogram was 
constructed using MEGA software (Kumar et al. 2001). 
Principal component analysis (PCA) of pairwise gen- 
etic distance between individuals was performed using 
GenALEx v. 6.4 (Peakall and Smouse 2006). The Bayesian 
model-based clustering analysis was used for determining 
the optimal number of genetic clusters found among rice 
varieties using the software STRUCTURE 2.3.3 (Pritchard 
et al. 2000), which partitions individuals into number of 
clusters (K) based on the multilocus genotypic data. The 
admixture model and correlated allele frequencies were 
applied for each run with 10,000 burn-in period (iteration) 
and 100,000 Markov Chain Monte Carlo (MCMC) repli- 
cation. The optimum A" value, which indicates the number 
of genetically distinct clusters in the data, was determined 
from 10 replicate runs for each value of K (Evanno et al. 
2005). The AK was based on the change in the log prob- 
ability of the data between successive K values. Software 
program Structure Harvester v6.0 (Earl and von Holdt, 
2011) was used for calculating parameters of Evanno et al. 
(2005). The results of five independent runs were consist- 
ently converged to the same values. 

Results 

Overall microsatellite diversity 

The seven selected SSR loci amplified DNA fragments 
from 29 O. sativa varieties and O. rufipogon with con- 
sistent reproducibility. A total of 96 alleles with an ave- 
rage of 13.57 alleles per locus were detected among all 
studied samples. The highest number of alleles (21) was 
detected in the locus RM264 and the lowest (4) was in 
the locus RM130. The indigenous rice varieties were 
genetically variable, while agronomically improved var- 
ieties were monomorphic within varieties at all loci. The 
highest gene diversity value of 0.884 was detected at 
RM264 and the lowest value of 0.419 detected in RM130 
(Table 2). 



Indigenous rice varieties in NE India showed high level 
of genetic diversity with an overall allelic richness of 
10.205 per locus and a gene diversity value of 0.776, 
while the agronomically improved varieties had signifi- 
cantly lower average allelic richness of 2.857 per locus 
and gene diversity was 0.459. A very high level of differ- 
entiation (Fsx = 0.754) was also detected among the rice 
varieties. 

Within variety genetic diversity 

The average observed number of alleles among indi- 
genous rice varieties ranged from 1.14 (Joha and Bhe- 
rapawa) to 3.29 (Lallatoi) while the corresponding value 
was only 1.00 for the agronomically improved varieties. 
Some of the elite traditional rice varieties (including 
Lallatoi, Mulahail, Aubalam, Mimutim) showed very 
high levels of genetic diversity as measured in average 
numbers of alleles, rare alleles and Nei gene diversity. 
Two of those varieties exhibited relatively high numbers 
of rare alleles {Lallatoi = 4; Mimutim = 3). Locus wise 
allele frequencies are presented in Additional file 1: 
Table SI. Nei's gene diversity values ranged from 0.051 
(Bherapawa) to 0.498 (Lallatoi) with an average of 
0.223 across all indigenous varieties. Shannon informa- 
tion content varied widely across varieties from 0.072 
(Bherapawa) to 0.854 (Lallatoi) and the average was 
0.338 across varieties. The diversity parameters across 
varieties are presented in Table 1. The pairwise genetic 
differentiation among varieties (Fst) ranged from 0.375 
to 1.000 and highly significant (p < 0.001). The pairwise 
-Fst values are given in Additional file 2: Table S2. 

Genetic diversity among types 

Different levels of genetic variation were observed in dif- 
ferent types of indigenous rice from NE India. The 
highest diversity was detected among the sali type with 
an average allelic richness and gene diversity of 7.585 
(±3.604) and 0.747 (±0.127) respectively. The next level 
of genetic diversity was detected among the jum type 
followed by the glutinous and bow types (Table 3). On 
the other hand, agronomically improved types showed 
the lowest levels of diversity (average allelic richness 
2.798 ± 1.438; average gene diversity 0.459 ± 0.251). All 
types showed very high inbreeding coefficient ranging 
from 0.936 to 1.000, which could be attributable to the 
selfing mating system of the cultivated rice. Among indi- 
genous rice varieties, the highest average gene diversity 
within type (/^s(w)) was observed in jum (0.259) and the 
lowest was in glutinous type (0.189). Population differen- 
tiation study within different types showed very low Fst 
values ranging from 0.023 in sali type to 0.036 in bow 
type (Table 3). The AMOVA results showed statistically 
significant differentiation (p < 0.01) with 25% variation 
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Table 3 Population structure and F-statistics of different types of indigenous and agronomically improved rice 
varieties in NE India 



Type 


Allelic richness 


Gene diversity 


Inbreeding coefficient 




F ST(W) 


Sali 


7.585 (3.604) 


0.747 (0.127) 


0.984 


0.222 


0.023 


Jum 


5.056 (3.061) 


0.627 (0.187) 


1.000 


0.259 


0.032 


Glutinous 


4.727 (1.901) 


0.602 (0.261) 


0.936 


0.189 


0.029 


Boro 


3.857 (1.864) 


0.596 (0.280) 


0.980 


0.241 


0.036 


mproved 


2.798 (1.438) 


0.459 (0.251) 


1.000 


0 


0.029 



Allelic richness is based on minimum sample size of 30 diploid individuals. H S(W ) = average genetic diversity within type; F ST (w) = genetic differentiation within type. 
Values in parenthesis represent standard deviation. 



among individuals, 66% among varieties and 9% among 
cultivation types (Table 4). 

Genetic structure analysis 

The UPGMA clustering based on chord genetic distance 
grouped rice varieties into two distinct groups (Figure 2). 
The Group-I in the UPGMA tree consists of both indi- 
genous and the agronomically improved varieties. All ag- 
ronomically improved varieties clustered within Group-I, 
which could be considered as indica sub-species. The 
other group (Group-II) consisted of a few indigenous 
varieties belonging to sali and jum types and could be 
considered as the japonica sub-species. O. rufipogon ac- 
cessions appeared intermediate between indica and japo- 
nica groups (Figure 2). This analysis revealed that 62.5% 
of the traditional rice varieties in Eastern Himalayan 
region of NE India are of sub-species indica while 
37.5% are japonica sub-species. 

The UPGMA tree revealed that rice varieties clustered 
into smaller sub-groups based on type, grain qualities or 
geographic origin. For example, boro, jum, glutinous, 
and agronomically improved varieties clustered together 
into smaller sub-groups within Group-I {indica) while 
the Group-II {japonica) formed two sub-groups cor- 
responding the geographic locations (Additional file 3: 
Figure SI). A few sub-groups and varieties (marked with 
double asterisk), however, did not cluster with respective 
types or grain quality (Additional file 3: Figure SI). 

The PCA analysis using pairwise genetic distances re- 
vealed that the first three principal components explai- 
ned 59.91% of the total variation and showed similar 
clustering of rice varieties into Group-I {indica) and 
Group-II {japonica) (Figure 3). Three of the agronomi- 
cally improved varieties {Pankaj, Bahadur and Ranjit) 



Table 4 Analysis of molecular variance (AMOVA) based 
on 7 SSR loci of traditional and agronomically improved 
rice varieties in Northeast India 



Amova analysis 


df 


SS 


MS 


% of variation 


P-value 


Among type 


4 


294.45 


1 29.78 


8 


>0.001 


Among varieties 


24 


912.96 


76.54 


66 


>0.001 


Within varieties 


270 


366.05 


2.80 


26 


>0.001 



df, degree of freedom; SS, sum of square; MS, Means of square. 



formed a distinct group but showed closer affinity to the 
Group-I {indica). O. rufipogon accessions showed in- 
termediate position between the two groups (Figure 3) 
similar to clustering in the UPGMA tree. 

The Bayesian based analysis of population struc- 
ture showed that the highest log likelihood is at K= 2 
(Figure 4) suggesting two major groups corresponding 
to two distinct clusters. Individual assignments into two 
clusters revealed that Group-I (green color, Figure 5) 
consists of 34% of varieties and include sub-species ja- 
ponica with more than 95% ancestry. The other 52% of 
varieties including agronomically improved accessions 
formed Group-II (red color, Figure 5) corresponding to 
the sub-species indica with more than 95% ancestry. 
However, 14% of the indigenous varieties showed mixed 
ancestry of both indica and japonica types. The com- 
parison of STRUCTURE results with UPGMA and PCA 
results revealed that three varieties {Kawanglawang, 
Local Basmati and Bashful; varieties 3, 6, and 18 marked 
with asterisk; Additional file 3: Figure S2a) interchan- 
ged between Group-I {indica) and Group-II (japonica). 




2.0 

Figure 2 UPGMA tree based on chord genetic distance 
(Cavalli-Sforza and Edwards 1967) showing genetic relationships 
among 29 rice varieties in Northeast India. 
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Group-II (japonica) 




Figure 3 Principal component analysis of indigenous and agronomically improved rice varieties based on 7 SSR loci. Different varieties 
grouped together corresponding to two sub-species {indica and japonica). 



Discussion 

Genetic diversity 

The present study revealed exceptionally high genetic 
variation, with an average allelic richness of 10.205 and 
an overall Nei's gene diversity of 0.776 among indige- 
nous rice varieties in NE India as compared to signi- 
ficantly low average allelic richness (2.798) and gene 
diversity (0.459) in agronomically improved types. The 
levels of genetic diversity were also variable across differ- 
ent varieties and much higher than the agronomically im- 
proved varieties (Table 1). Although the varieties represent 
only a sub-set of total rice varieties in the region, the gene 
diversity detected is higher than the overall gene diversity 
of rice varieties reported from Yunnan province in China 
(0.706) (Tu et al. 2007) and Indonesia (0.68) (Thomson 
et al. 2007). The gene diversity detected in our study 
is comparable to the overall gene diversity of wild rice 
O. rufipogon (0.77) and O. nivara (0.64) populations of the 
Vientiane Plain of Laos (Kuroda et al. 2007) and the gene 
diversity of O. rufipogon in China (0.670) (Gao 2004). A 
previous study based on allozyme markers revealed a 
moderate genetic variability (Nei gene diversity = 0.341) 
among 289 rice varieties from NE India (Glaszmann et al. 
1989). The higher gene diversity values detected in the 



However, independent STRUCTURE runs without agro- 
nomically improved varieties grouped these varieties into 
the groups concordant with UPGMA and PCA analyses 
(Additional file 3: Figure S2b). Thus, the results of model 
based STRUCTURE analysis is in agreement with the 
UPGMA and PCA based clustering and grouping of rice 
varieties is consistent with the classification of indica and 
japonica types. 



t \ 




Figure 4 The relationship between AK and K showing the 
highest peak at K = 2. 
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Figure 5 Population structure of traditionally cultivated indigenous and agronomically improved rice varieties in the Eastern 
Himalayan region. The optimal value of K = 2. 



0.60 



0.40 



present study could be attributable to high resolving 
power of microsatellite markers. 

The present study revealed several indigenous rice var- 
ieties with high genetic diversity, which includes Lallatoi, 
Mulahail, Aubalam and Mimutim (Table 1). Despite 
the low yield, the traditional farmers in Hailakandi area 
(Barak Valley region of Assam) have been cultivating 
Lallatoi, Mulahail and Aubalam for over many genera- 
tions presumably for its superior nutritional quality and 
better taste (personal communication). The local tribal 
group members in the Garo Hills of Meghalaya pointed 
out the superior agronomical qualities of Mimutim. Our 
study revealed high genetic diversity in Mimutum, one of 
the highly valued rice varieties by native tribal groups. 
This reflects the importance of traditional knowledge in 
evaluation and conservation of indigenous crop genetic 
resources (Brush and Meng 1998). 

Most of the indigenous rice varieties are maintained 
and cultivated by traditional farmers in narrow geo- 
graphic regions. However, traditional farming practices 
are in decline due to preference for agronomically im- 
proved varieties for higher yield. Therefore, appropriate 
conservation measures should be taken to promote the 
cultivation of indigenous varieties with local traditional 
knowledge. 

The genetic diversity maintained in a species is con- 
sidered as a function of its ecological and evolutionary 
history (Hamrick and Godt 1996). The high genetic di- 
versity among NE Indian rice varieties have been de- 
scribed in relation to morpho-physiological characters 
(Vairavan et al. 1973), enzymatic characters (Glaszmann 
et al. 1989), agro-morphological traits (Borkakati et al. 2000) 
and molecular markers including RAPD (Sarma and 
Bahar 2005) and ISSR (Bhuyan et al. 2007). The high 
genetic diversity among rice varieties in the NE Indian 
region could be attributable to combined effect of wide 
eco-geographical conditions, diverse agro-ecosystems 
associated with various rice farming practices and di- 
verse human cultural preferences. High genetic diversity 
is also reported for other crop plants such as Zingiber 
officinale (Sajeev et al. 2011), Chilli (Yumnam et al. 
2012), Curcuma species (Das et al. 2011), Citrus species 



(Hazarika 2012) commonly cultivated in NE India, 
highlighting the importance of the region for germ- 
plasm conservation of many crop plants. 

We compared the levels of genetic diversity among 
different types of rice cultivated in NE India, and found 
that sali type possessed the highest gene diversity value 
of 0.747 and average allelic richness of 7.585. The major- 
ity of sali varieties are maintained by traditional farmers 
for specific traits such as aroma, grain size and shape, 
and tolerance to drought, insects and pests, which may 
contribute to the maintenance of high genetic variation. 
Jum type also showed high level of heterogeneity with 
gene diversity of 0.627 and average allelic richness of 
5.056. The traditional farming systems and local envir- 
onment associated with adaptation to diverse conditions 
including water deficient habitats on the slopes of hilly 
regions may have contributed to the maintenance of 
high genetic variability among the jum type. Due to their 
inherent high genetic diversity, sali and jum types should 
be prioritized in conservation and management plans and 
future breeding programs. 

The high F IS values among rice varieties of the region 
could be due to predominantly selfing breeding system with 
a very low outcrossing in O. sativa species (Oka 1988). The 
Fst results (Table 3) are also supported by AMOVA 
(Table 4) which indicated that 66% of the total vari- 
ation was due to differentiation among varieties. This 
indicates that rice varieties of the Eastern Himalayan 
region are highly differentiated. 

Population structure 

The UPGMA analyses using genetic distance data clus- 
tered rice varieties into two groups, which corresponded 
to O. sativa sub-species indica and japonica (Glaszmann 
1987; Oka 1988; Khush 1997). These results agree with 
the previous isozyme data based finding that showed 
the occurrence of two major groups of rice varieties 
in NE India (Glaszmann 1987). The PCA analysis and 
Model-based clustering method implemented in the 
STRUCTURE software also suggested the existence of 
two major groups corresponding to indica and japo- 
nica sub-species. The majority of varieties including 
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agronomically improved rice varieties clustered as one 
group within the sub-species indica. Most of the varieties 
were grouped into indica sub-species cluster while few 
varieties clustered into japonica sub-species. Vairavan 
et al. (1973) also reported similar results on the basis of 
amylose content, agronomic, and morphological charac- 
teristics. Our findings were similar to the study involv- 
ing Indonesian landraces where 68% of the varieties 
were assigned as indica and 32% as japonica (Thomson 
et al. 2007). However, a study of European rice collec- 
tion revealed that 89% of the accessions belonged to 
japonica type (Courtois et al. 2012). The O. rufipogon 
showed intermediate position between indica and japonica 
types suggesting a possible common ancestry of both indica 
and japonica types. 

Although there was no clear differentiation among jum, 
sali, bow, and glutinous varieties in the UPGMA and 
STRUCTURE analysis, the PCA analysis separated the 
agronomically improved varieties into a distinct group 
(Figure 3) closely associated with the indica type. This is 
expected as agronomically improved varieties included in 
the present study were derived from indica type. The 
STRUCTURE analysis did not show evidence for ad- 
mixture between the indica and japonica types in almost 
all varieties. This could be attributable to predominantly 
selfing or autogamous nature of the breeding system and 
associated restricted gene flow among populations. Only a 
few varieties showed mixed ancestry of indica and japo- 
nica type (Figure 5), which may be either due to partial 
differentiation or rare introgression between the two 
types. Similar structuring reported among Asian culti- 
vated rice Oryza sativa could be due to partial shar- 
ing of their ancestral genetic polymorphism and/or 
recent gene flow (Gao and Innan 2008). 

Glaszmann et al. (1989) identified seven groups using 
isozyme markers and reported typical indica and japo- 
nica sub-species, suggesting that varieties mostly grown 
in mountainous areas of Meghalaya and Arunachal 
Pradesh belong to japonica. However, the present study 
revealed that varieties in the mountainous areas of 
Meghalaya and Arunachal Pradesh represent both japon- 
ica and indica types. Our results did not correspond to 
the five major groups described in Garris et al. (2005). 

Conclusions 

In summary, high genetic diversity detected among trad- 
itional rice varieties in the Eastern Himalayan region of 
NE India is comparable to genetic diversity detected in 
wild rice populations in various parts of the world. Se- 
veral varieties with high genetic diversity and cultural 
importance were found in Barak Valley region of Assam 
and Garo Hills of Meghalaya. The sali and jum type sho- 
wed significantly higher levels of genetic diversity com- 
pared to agronomically improved types. Rice varieties in 



NE India clustered into two major groups corresponding 
to two sub-species, namely indica and japonica. Our 
findings highlights the importance of conservation of 
rice varieties in NE India as a means of preserving gen- 
etic diversity to maintain food security in the changing 
world. 

Additional files 



Additional file 1: Table SI. Locus wise allele frequencies of each 
variety based on 10 genotyped individuals. 

Additional file 2: Table S2. Pairwise F ST values among different rice 
varieties of eastern himalayan region in Northeast India and wild rice 
(0. rufipogon). 

Additional file 3: Figure SI. Sub-groups of rice varieties within group-l 
(indica) and group-ll (japonica) based on cultivation type, grain 
characteristics and geographic origin. Figure S2. STRUCTURE output (a) 
including agronomically improved varieties and (b) without agronomically 
improved varieties. Note that three varieties (Kawanglawang, Local Basmati 
and Bashful; 3, 6, and 18 marked with asterisk) interchanged between 
group-l (indica) and group-ll (japonica) groups in (a) while all varieties of 
group-l (indica) and group-ll (japonica) found in UPGMA and PCA analysis 
clustered together in (b). 
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